Search CORE

Empirical Bayes models for multiple probe type microarrays at the probe level

Author: A Hess
A Sjögren
A Spira
AM Hein
AP Dempster
B Efron
BP Durbin
BP Durbin
D Gaile
D Holder
DM Rocke
E Kristiansson
E Kristiansson
GK Smyth
I Lönnstedt
IA Eaves
J Comander
J Hu
JW Tukey
LM Cope
M Åstrand
MA Sartor
Magnus Åstrand
Mats Rudemo
N Jain
P Baldi
P Munson
Petter Mostad
R Opgen-Rhein
RA Irizarry
RS Stearman
S Choe
SC Geller
T Hastie
VG Tusher
W Huber
W Lemon
X Liu
X Liu
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background When analyzing microarray data a primary objective is often to find differentially expressed genes. With empirical Bayes and penalized t-tests the sample variances are adjusted towards a global estimate, producing more stable results compared to ordinary t-tests. However, for Affymetrix type data a clear dependency between variability and intensity-level generally exists, even for logged intensities, most clearly for data at the probe level but also for probe-set summarizes such as the MAS5 expression index. As a consequence, adjustment towards a global estimate results in an intensity-level dependent false positive rate. Results We propose two new methods for finding differentially expressed genes, Probe level Locally moderated Weighted median-t (PLW) and Locally Moderated Weighted-t (LMW). Both methods use an empirical Bayes model taking the dependency between variability and intensity-level into account. A global covariance matrix is also used allowing for differing variances between arrays as well as array-to-array correlations. PLW is specially designed for Affymetrix type arrays (or other multiple-probe arrays). Instead of making inference on probe-set summaries, comparisons are made separately for each perfect-match probe and are then summarized into one score for the probe-set. Conclusion The proposed methods are compared to 14 existing methods using five spike-in data sets. For RMA and GCRMA processed data, PLW has the most accurate ranking of regulated genes in four out of the five data sets, and LMW consistently performs better than all examined moderated t-tests when used on RMA, GCRMA, and MAS5 expression indexes.</p

Springer

Chalmers Publication Library

Chalmers Research

An expression meta-analysis of predicted microRNA targets identifies a diagnostic signature for lung cancer

Author: A Bhattacharjee
A Clegg
A Gaur
A Potti
A Spira
AC Borczuk
B Angulo
D Moher
DG Beer
DM Parkin
DN Hayes
FV Karginov
G Anumanthan
GT Bommer
HY Lee
J Lu
J Takamizawa
JB Axelsen
JE Larsen
JE Larsen
KK Dutta
L He
L Johnson
LP Lim
M Raponi
MA van der Drift
ME Garber
MH Jones
N Yanaihara
P Sethupathy
PB Bach
R Tibshirani
RS Stearman
S Itoh
S Ramalingam
S Wachi
S Wang
SL Yu
X Ge
Y Liang
Y Lu
Yu Liang
Publication venue: BioMed Central
Publication date: 01/12/2008
Field of study

Abstract Background Patients diagnosed with lung adenocarcinoma (AD) and squamous cell carcinoma (SCC), two major histologic subtypes of lung cancer, currently receive similar standard treatments, but resistance to adjuvant chemotherapy is prevalent. Identification of differentially expressed genes marking AD and SCC may prove to be of diagnostic value and help unravel molecular basis of their histogenesis and biologies, and deliver more effective and specific systemic therapy. Methods MiRNA target genes were predicted by union of miRanda, TargetScan, and PicTar, followed by screening for matched gene symbols in NCBI human sequences and Gene Ontology (GO) terms using the PANTHER database that was also used for analyzing the significance of biological processes and pathways within each ontology term. Microarray data were extracted from Gene Expression Omnibus repository, and tumor subtype prediction by gene expression used Prediction Analysis of Microarrays. Results Computationally predicted target genes of three microRNAs, miR-34b/34c/449, that were detected in human lung, testis, and fallopian tubes but not in other normal tissues, were filtered by representation of GO terms and their ability to classify lung cancer subtypes, followed by a meta-analysis of microarray data to classify AD and SCC. Expression of a minimal set of 17 predicted miR-34b/34c/449 target genes derived from the developmental process GO category was identified from a training set to classify 41 AD and 17 SCC, and correctly predicted in average 87% of 354 AD and 82% of 282 SCC specimens from total 9 independent published datasets. The accuracy of prediction still remains comparable when classifying 103 AD and 79 SCC samples from another 4 published datasets that have only 14 to 16 of the 17 genes available for prediction (84% and 85% for AD and SCC, respectively). Expression of this signature in two published datasets of epithelial cells obtained at bronchoscopy from cigarette smokers, if combined with cytopathology of the cells, yielded 89–90% sensitivity of lung cancer detection and 87–90% negative predictive value to non-cancer patients. Conclusion This study focuses on predicted targets of three lung-enriched miRNAs, compares their expression patterns in lung cancer by their GO terms, and identifies a minimal set of genes differentially expressed in AD and SCC, followed by validating this gene signature in multiple published datasets. Expression of this gene signature in bronchial epithelial cells of cigarette smokers also has a great sensitivity to predict the patients having lung cancer if combined with cytopathology of the cells.</p

Discovering collectively informative descriptors from high-throughput experiments

Author: A Bhattacharjee
A Golbraikh
A Hess
A Sadanandam
A Tropsha
AJ Sutton
AN Pronin
B Millauer
B Singh
BA Jensen
CA Powell
Clark D Jeffries
CM Findley
DA Smirnov
DG Beer
Diana O Perkins
DR Cox
DR Rhodes
EP Kopantzev
Fred A Wright
GC Chang
HS Soifer
J Kazius
J Lamb
J Zar
JS Nam
L Cronbach
L He
M Blangiardo
M Selbach
P Greenwood
P Westfall
R Breitling
R Breitling
R Edgar
R Li
RS Stearman
T Barrett
William O Ward
YH Soung
Ø Langsrud
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Improvements in high-throughput technology and its increasing use have led to the generation of many highly complex datasets that often address similar biological questions. Combining information from these studies can increase the reliability and generalizability of results and also yield new insights that guide future research. Results This paper describes a novel algorithm called BLANKET for symmetric analysis of two experiments that assess informativeness of descriptors. The experiments are required to be related only in that their descriptor sets intersect substantially and their definitions of case and control are consistent. From resulting lists of n descriptors ranked by informativeness, BLANKET determines shortlists of descriptors from each experiment, generally of different lengths p and q. For any pair of shortlists, four numbers are evident: the number of descriptors appearing in both shortlists, in exactly one shortlist, or in neither shortlist. From the associated contingency table, BLANKET computes Right Fisher Exact Test (RFET) values used as scores over a plane of possible pairs of shortlist lengths <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. BLANKET then chooses a pair or pairs with RFET score less than a threshold; the threshold depends upon n and shortlist length limits and represents a quality of intersection achieved by less than 5% of random lists. Conclusions Researchers seek within a universe of descriptors some minimal subset that collectively and efficiently predicts experimental outcomes. Ideally, any smaller subset should be insufficient for reliable prediction and any larger subset should have little additional accuracy. As a method, BLANKET is easy to conceptualize and presents only moderate computational complexity. Many existing databases could be mined using BLANKET to suggest optimal sets of predictive descriptors.</p

Carolina Digital Repository

Immuno-Therapy with Anti-CTLA4 Antibodies in Tolerized and Non-Tolerized Mouse Tumor Models

Author: A Teige
A van Elsas
AA Hurwitz
AH Klopp
André Lieber
B Kavanagh
CL Li
D Stone
DI Godfrey
DR Leach
DW Emery
Eric J. Kremer
H Lu
Hans-Peter Kiem
I Beyer
I Beyer
Ines Beyer
J Diao
J Shimizu
J Shimizu
J Szulc
JG Egen
Jonas Persson
K Ko
KL Knutson
KL Knutson
KL Knutson
LH Camacho
M Aker
M De Palma
M Terabe
MD Griffin
MO Li
P Attia
P Malik
R Strauss
R Strauss
Roma Yumul
RS Stearman
S Hegde
S Khan
S Read
S Tuve
S Tuve
SA Quezada
SJ Shieh
SS Agarwala
Steve Roffler
T Yamaguchi
Z Li
ZongYi Li
Publication venue: Public Library of Science
Publication date: 14/07/2011
Field of study

Monoclonal antibodies specific for cytotoxic T lymphocyte-associated antigen 4 (anti-CTLA4) are a novel form of cancer immunotherapy. While preclinical studies in mouse tumor models have shown anti-tumor efficacy of anti-CTLA4 injection or expression, anti-CTLA4 treatment in patients with advanced cancers had disappointing therapeutic benefit. These discrepancies have to be addressed in more adequate pre-clinical models. We employed two tumor models. The first model is based on C57Bl/6 mice and syngeneic TC-1 tumors expressing HPV16 E6/E7. In this model, the HPV antigens are neo-antigens, against which no central tolerance exists. The second model involves mice transgenic for the proto-oncogen neu and syngeneic mouse mammary carcinoma (MMC) cells. In this model tolerance to Neu involves both central and peripheral mechanisms. Anti-CTLA4 delivery as a protein or expression from gene-modified tumor cells were therapeutically efficacious in the non-tolerized TC-1 tumor model, but had no effect in the MMC-model. We also used the two tumor models to test an immuno-gene therapy approach for anti-CTLA4. Recently, we used an approach based on hematopoietic stem cells (HSC) to deliver the relaxin gene to tumors and showed that this approach facilitates pre-existing anti-tumor T-cells to control tumor growth in the MMC tumor model. However, unexpectedly, when used for anti-CTLA4 gene delivery in this study, the HSC-based approach was therapeutically detrimental in both the TC-1 and MMC models. Anti-CTLA4 expression in these models resulted in an increase in the number of intratumoral CD1d+ NKT cells and in the expression of TGF-β1. At the same time, levels of pro-inflammatory cytokines and chemokines, which potentially can support anti-tumor T-cell responses, were lower in tumors of mice that received anti-CTLA4-HSC therapy. The differences in outcomes between the tolerized and non-tolerized models also provide a potential explanation for the low efficacy of CTLA4 blockage approaches in cancer immunotherapy trials

Public Library of Science (PLOS)

Large-scale integration of cancer microarray data identifies a robust common cancer signature

Author: A Bhattacharjee
A Cromer
AC Tan
AI Su
AI Su
BJ Quade
CA Iacobuzio-Donahue
CD Logsdon
CF Basil
D Geman
D Talantov
DG Beer
DH Gutmann
Donald Geman
DR Rhodes
DR Rhodes
DS Rickman
E Dehan
E Segal
F Zhan
GJ Gordon
HF Frierson Jr.
I Yanai
J Luo
JB Welsh
JM Lancaster
JPT Higgins
L Dyrskjot
L Liotta
L Xu
Lei Xu
LL Hsiao
M Bittner
M Lenburg
MA Watson
ND Price
P Pavlidis
PJ Hoffman
R Shai
Raimond L Winslow
RC Bast Jr.
RS Stearman
S Michiels
S Ramaswamy
S Wachi
S Welle
SL Pomeroy
SM Dhanasekaran
SS Yoon
T Barrett
T Yagi
TJ Giordano
TR Golub
X Chen
X Yang
Y Hippo
Y Huang
YP Yu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background There is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagnostic tests to determine the presence of cancer. Results In this study, we collect and integrate ~ 1500 microarray gene expression profiles from 26 published cancer data sets across 21 major human cancer types. We then apply a statistical method, referred to as the <it>T</it>op-<it>S</it>coring <it>P</it>air of <it>G</it>roups (TSPG) classifier, and a repeated random sampling strategy to the integrated training data sets and identify a common cancer signature consisting of 46 genes. These 46 genes are naturally divided into two distinct groups; those in one group are typically expressed less than those in the other group for cancer tissues. Given a new expression profile, the classifier discriminates cancer from normal tissues by ranking the expression values of the 46 genes in the cancer signature and comparing the average ranks of the two groups. This signature is then validated by applying this decision rule to independent test data. Conclusion By combining the TSPG method and repeated random sampling, a robust common cancer signature has been identified from large-scale microarray data integration. Upon further validation, this signature may be useful as a robust and objective diagnostic test for cancer.</p

Protein Signature of Lung Cancer Tissues

Author: A Jemal
A Taguchi
A Yuan
AE Carpenter
B Bartling
BR Wegmann
C Robert
C Yfanti
CJ Moran
CW Seder
D Hanahan
D Liu
D Olchovsky
Deborah Ayers
Derek Thirstrup
DG Yoo
DR Senger
Edward N. Brody
EP Diamandis
ES Kassis
G Deeb
G Fontanini
G Fontanini
Geoffrey S. Baird
H Chang
H Imoto
H Nakamura
H-S Hofmann
H-S Hofmann
HM Cangara
J Chung-man Ho
J Safranek
J Su
J Yoo
JD Storey
Jeffrey J. Walker
JPA Ioannidis
JW Son
L Gold
Larry Gold
LV Sequist
M Boeri
M Okada
M Salden
M Takada
Michael R. Mehan
MJ Carlini
MM Krady
Nebojsa Janjic
P Allavena
P Bornstein
P Schraml
PJ Simpson-Haidaris
PMM Bossuyt
R Jing
Rachel M. Ostroff
RL Yauch
RM Bremnes
RM Ostroff
Rossella Rota
RS Herbst
RS Stearman
S Gupta
S Singhal
SA Shah
Sheri K. Wilcox
T Chijiwa
T Iizasa
T Migita
T-M Kim
Thale C. Jarvis
TR Jones
Wei Xiong
Y Chen
Y Chen
Y Liu
Y Ohta
Y Oshika
Y-C Lee
Y-J Cheng
Z Zhang
ZJ Chen
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Lung cancer remains the most common cause of cancer-related mortality. We applied a highly multiplexed proteomic technology (SOMAscan) to compare protein expression signatures of non small-cell lung cancer (NSCLC) tissues with healthy adjacent and distant tissues from surgical resections. In this first report of SOMAscan applied to tissues, we highlight 36 proteins that exhibit the largest expression differences between matched tumor and non-tumor tissues. The concentrations of twenty proteins increased and sixteen decreased in tumor tissue, thirteen of which are novel for NSCLC. NSCLC tissue biomarkers identified here overlap with a core set identified in a large serum-based NSCLC study with SOMAscan. We show that large-scale comparative analysis of protein expression can be used to develop novel histochemical probes. As expected, relative differences in protein expression are greater in tissues than in serum. The combined results from tissue and serum present the most extensive view to date of the complex changes in NSCLC protein expression and provide important implications for diagnosis and treatment

CiteSeerX

Public Library of Science (PLOS)

FigShare

Twist1 Suppresses Senescence Programs and Thereby Accelerates and Maintains Mutant Kras-Induced Lung Tumorigenesis

Author: A Bhattacharjee
A Cromer
A Jemal
A Subramanian
A Ventura
A Vichalkovski
AK Perl
Alejandro Sweet-Cordero
AY Nikitin
BE Johnson
C Nardella
C Scholl
Carsten H. Nielsen
CH Nielsen
CH Wu
Charles M. Rudin
CJ Sarkisian
D Li
D Pan
DA Barbie
DA Eberhard
DA Tuveson
Dean W. Felsher
DG Beer
DM Feldser
EL Jackson
Emelyn H. Shroff
F Janku
GH Fisher
GL Verdine
H Ji
H. Leighton Grimes
HK Lin
IA Stasinopoulos
J Luo
J Yang
JF Mahler
Joy Chen
JW Jang
K Hoek
K Ohuchida
K Politi
K Theilgaard-Monch
KE Lee
Khaled Aziz
KY Sarin
L Ding
L Regales
L Soucek
LJ Su
M Collado
M Jechlinger
M Puyol
M Serrano
M Shiota
MA Smit
MH Yang
MR Junttila
MT Landi
N Entz-Werle
Nadia Withofs
NP Young
Pablo Tamayo
Phuoc T. Tran
PT Tran
R Loew
R Maestro
RA Mesa
Richard Luong
RS Stearman
S Ansieau
S Valsesia-Wittmann
S Wachi
Sandhya T. Das
Sanjiv S. Gambhir
Saravanan Thiyagarajan
SG Talbot
Silvestre Vicent
Stacey J. Adam
Tahera Zabuawala
Tarek Salih
Timothy F. Burns
W Pao
W Xue
WK Kwok
WK Kwok
Y Jiang
Y Mironchik
Yoon-Jae Cho
Z Zhang
ZF Chen
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

KRAS mutant lung cancers are generally refractory to chemotherapy as well targeted agents. To date, the identification of drugs to therapeutically inhibit K-RAS have been unsuccessful, suggesting that other approaches are required. We demonstrate in both a novel transgenic mutant Kras lung cancer mouse model and in human lung tumors that the inhibition of Twist1 restores a senescence program inducing the loss of a neoplastic phenotype. The Twist1 gene encodes for a transcription factor that is essential during embryogenesis. Twist1 has been suggested to play an important role during tumor progression. However, there is no in vivo evidence that Twist1 plays a role in autochthonous tumorigenesis. Through two novel transgenic mouse models, we show that Twist1 cooperates with KrasG12D to markedly accelerate lung tumorigenesis by abrogating cellular senescence programs and promoting the progression from benign adenomas to adenocarcinomas. Moreover, the suppression of Twist1 to physiological levels is sufficient to cause Kras mutant lung tumors to undergo senescence and lose their neoplastic features. Finally, we analyzed more than 500 human tumors to demonstrate that TWIST1 is frequently overexpressed in primary human lung tumors. The suppression of TWIST1 in human lung cancer cells also induced cellular senescence. Hence, TWIST1 is a critical regulator of cellular senescence programs, and the suppression of TWIST1 in human tumors may be an effective example of pro-senescence therapy

Copenhagen University Research Information System

Open Repository and Bibliography - Liège

FigShare

Meta-analysis of muscle transcriptome data using the MADMuscle database reveals biologically relevant gene patterns

Author: A Dubrovsky
A Kuhn
AI Su
AJ Holloway
AJ Wagers
Armelle Magot
Audrey Bihouée
BR Zeeberg
BS Tseng
C Romualdi
C Thieblemont
C Workman
D Baron
D Baron
D Baron
D Baron
D Baron
D Baron
D Baron
D Ghosh
D Mirebeau-Prunier
Daniel Baron
DJ Lockhart
DN Grigoryev
DR Rhodes
DR Rhodes
DR Rhodes
DR Rhodes
E Calura
E Segal
E Segal
Emeric Dubois
EP Hoffman
EW Forgy
F Chalmel
F Pan
Frédérique Savagner
G Lamirault
G Parmigiani
Gérard Ramstein
H Fang
HK Lee
HM Wain
I Leguen
J Chen
J Lamb
J Wang
JC Newman
JC Newman
JE Larkin
JF Fontaine
JK Choi
JK Choi
JM Stuart
JN Haslett
JN Haslett
JN Haslett
K De Preter
K Wennmalm
KJ Mitchell
M Ashburner
M Bakay
M Pescatori
M Schena
Marja Steenman
MB Eisen
MJ de Hoon
O Larsson
O Larsson
O Troyanskaya
P Cahan
Philippe Jourdon
PJ Rousseeuw
PK Tan
R Chen
R Edgar
R Ihaka
R Jelier
R Mehra
RA Irizarry
Raluca Teusan
Reiner Veitia
RG Jenner
RS Stearman
Rémi Houlgatte
S Ramaswamy
S Tavazoie
SA McCarroll
TE Bertorini
TF Cox
TR Hughes
V Detours
WP Kuo
XJ Zhou
Y Moreau
Y Yi
Yann Péréon
YH Yang
YW Chen
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background DNA microarray technology has had a great impact on muscle research and microarray gene expression data has been widely used to identify gene signatures characteristic of the studied conditions. With the rapid accumulation of muscle microarray data, it is of great interest to understand how to compare and combine data across multiple studies. Meta-analysis of transcriptome data is a valuable method to achieve it. It enables to highlight conserved gene signatures between multiple independent studies. However, using it is made difficult by the diversity of the available data: different microarray platforms, different gene nomenclature, different species studied, etc. Description We have developed a system tool dedicated to muscle transcriptome data. This system comprises a collection of microarray data as well as a query tool. This latter allows the user to extract similar clusters of co-expressed genes from the database, using an input gene list. Common and relevant gene signatures can thus be searched more easily. The dedicated database consists in a large compendium of public data (more than 500 data sets) related to muscle (skeletal and heart). These studies included seven different animal species from invertebrates (<it>Drosophila melanogaster, Caenorhabditis elegans</it>) and vertebrates (<it>Homo sapiens, Mus musculus, Rattus norvegicus, Canis familiaris, Gallus gallus</it>). After a renormalization step, clusters of co-expressed genes were identified in each dataset. The lists of co-expressed genes were annotated using a unified re-annotation procedure. These gene lists were compared to find significant overlaps between studies. Conclusions Applied to this large compendium of data sets, meta-analyses demonstrated that conserved patterns between species could be identified. Focusing on a specific pathology (Duchenne Muscular Dystrophy) we validated results across independent studies and revealed robust biomarkers and new pathways of interest. The meta-analyses performed with MADMuscle show the usefulness of this approach. Our method can be applied to all public transcriptome data.</p